Signature Files: An Integrated Access Method for Formatted and Unformatted Databases

نویسندگان

  • Deniz Aktug
  • Fazli Can
  • Deniz AKTUG
چکیده

----*------.----..-...-.-...-.*.-*-....-.-*.----..----.-*...-...-...---.----.--... ".. The signature file approach is one of the most powerful information storage and retrieval techniques which is used for finding the data objects that are relevant to the user queries. The main idea of all signature based schemes is to reflect the essence of the data items into bit patter& (descriptors or signatures) and store them in a separate file which acts as a filter to eliminate the non aualifvine data items for an information reauest. It pro;ides an integrated access method for both formattid and &formatted databases. A comp&ative overview and discussion of the proposed signatnre generation methods and the major signature file organization schemes are presented. Applications of the signature techniques to formatted and unformatted databases, single and multiterm query cases, serial and paratlei architecture. static and dynamic environments are provided with a special emphasis on the multimedia databases where the pioneering prototype systems using signatnres yield highly encouraging results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The HOOKAH Information Extraction System

The focus of Project HOOKAH is to improve the processing of the DEA-6 report, a semi-formatted report generated primarily by field agents, as well as legal staff, analysts, and others. DEA-6s are organized into case files, and are composed of multiple sections with varying amounts of formatting. Header fields are normally highly formatted, and indicate the subject, case, date, time, etc. There ...

متن کامل

Design of a Signature File Method that Accounts for Non-Uniform Occurrence and Query Frequencies

In this paper we study a variation of the signature Ale access method for text and attribute retrieval. According to this method, the documents (or records) are stored sequentially in the “text flle”. Abstractions (“signatures”) of the documents (or records) are stored in the “signature Ale”. The latter serves as a Alter on retrieval: It helps discarding a large number of non-qualifying documen...

متن کامل

A Superimposed Coding Scheme Based on Multiple Block Descriptor Files for Indexing Very Large Data Bases

A new signature file method for accessing information from large data files containing both formatted and free text data is presented. The new method, called the multiorganizational scheme is proposed for indexing very large data files containing hundreds of thousands or possibly millions of records.

متن کامل

An integrated genetic data environment (GDE)-based LINUX interface for analysis of HIV-1 and other microbial sequences

MOTIVATION Sequence databases encode a wealth of information needed to develop improved vaccination and treatment strategies for the control of HIV and other important pathogens. To facilitate effective utilization of these datasets, we developed a user-friendly GDE-based LINUX interface that reduces input/output file formatting. DESIGN AND RESULTS GDE was adapted to the Linux operating syste...

متن کامل

A Method for Protecting Access Pattern in Outsourced Data

Protecting the information access pattern, which means preventing the disclosure of data and structural details of databases, is very important in working with data, especially in the cases of outsourced databases and databases with Internet access. The protection of the information access pattern indicates that mere data confidentiality is not sufficient and the privacy of queries and accesses...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008